Engineering posts about Transfer Learning
Curated summaries and key learnings for engineers working with Transfer Learning.
Making User-Sequence Data More Cost-Efficient, Faster, and Easier to Use
This article discusses the redesign of a user-sequence platform aimed at improving the efficiency, speed, and usability of user data for machine learning applications. It addresses the challenges...
The JavaScript AI Build-a-thon Season 2 starts today!
The JavaScript AI Build-a-thon is a comprehensive program aimed at bridging the gap in AI development for JavaScript and TypeScript developers. Spanning four weeks, the event includes self-paced...
Reel Friends: Building Social Discovery that Scales to Billions
In the Meta Tech Podcast episode featuring Pascal Hartig, the engineering intricacies behind the 'Friend Bubbles' feature of Facebook Reels are explored. The discussion highlights the evolution of...
What the design-to-code loop unlocks
The article explores the evolving relationship between design and code facilitated by AI technologies, particularly within the Figma platform. It emphasizes how AI is transforming traditional...
Addressing HR's widening capacity gap with AI
The article outlines the pressing challenges faced by HR departments in the wake of increasing demands and limited resources, highlighting the widening capacity gap exacerbated by post-pandemic...
Text-Conditional JEPA for Learning Semantically Rich Visual Representations
The article introduces Text-Conditional JEPA (TC-JEPA), a new framework for learning semantically rich visual representations by leveraging image captions to modulate predicted features. This...
Bootstrapping Sign Language Annotations with Sign Language Models
The article presents a novel approach to enhance sign language annotation through machine learning techniques. It outlines the limitations of current datasets and introduces a pseudo-annotation...
STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows
The article introduces STARFlow-V, a novel video generative model that leverages normalizing flows for end-to-end likelihood-based generation. Unlike conventional diffusion-based models, STARFlow-V...
From Clicks to Conversions: Architecting Shopping Conversion Candidate Generation at Pinterest
The article discusses Pinterest's development of a shopping conversion candidate generation model aimed at optimizing offsite conversion events, which are typically sparse and noisy. It details the...
AI App Development: Guide To Building AI-Powered Apps
This article serves as a detailed guide for developers looking to build AI-powered applications, emphasizing the importance of structured planning and execution. It outlines the phases of AI app...
A Practical Guide to LLM Fine Tuning
This article serves as a practical guide for ML engineers and AI practitioners focused on fine-tuning large language models (LLMs) for specific tasks. It outlines the entire lifecycle of LLM...
How to transform document activation workflows with Genie and Agent Bricks
The article outlines the challenges organizations face in managing document workflows, emphasizing the need for a unified data foundation to leverage AI effectively. It introduces Databricks'...
Why Your Agents Can’t Read Enterprise Documents — and How to Fix It
The article highlights the limitations of current AI agents in processing enterprise documents, emphasizing that the primary challenge lies in reading rather than reasoning. It introduces Document...
Capacity Efficiency at Meta: How Unified AI Agents Optimize Performance at Hyperscale
The article outlines Meta's innovative Capacity Efficiency Program, which leverages a unified AI agent platform to enhance performance optimization across its infrastructure. By automating the...
From bytecode to bytes: automated magic packet generation
The article discusses the challenges of reverse-engineering Linux malware that utilizes Berkeley Packet Filter (BPF) socket programs. It introduces a novel approach using symbolic execution with the...
Advancing semantic search for millions of Rovo users
The article outlines Rovo's innovative approach to semantic search, which enhances the ability of users to find relevant information across various tools like Jira and Confluence. By moving beyond...
The Hidden Cost of Complex AI Platforms: Why Developer Experience Matters
The article explores the often-overlooked costs associated with complex AI platforms, particularly emphasizing the developer experience. It highlights how fragmented workflows and unclear...
Beyond Real Data: Synthetic Data through the Lens of Regularization
The paper discusses the role of synthetic data in improving generalization in machine learning, particularly when real data is limited. It introduces a learning-theoretic framework that quantifies...
Revisiting the Scaling Properties of Downstream Metrics in Large Language Model Training
This paper investigates the scaling properties of downstream metrics in the training of Large Language Models (LLMs). It challenges the traditional reliance on proxy metrics, proposing a direct...
Tevogen Bio’s Journey to Streamlining Life-Saving Therapies
Tevogen Bio is revolutionizing the drug discovery process through its ExacTcell platform, which utilizes proprietary PredicTcell AI models to automate and expedite the traditionally lengthy and...